Discriminant wavelet basis construction for speech recognition

نویسندگان

  • Christopher John Long
  • Sekharajit Datta
چکیده

In this paper, a new feature extraction methodology based on Wavelet Transforms is examined, which unlike some conventional parameterisation techniques, is flexible enough to cope with the broadly differing characteristics of typical speech signals. A training phase is involved during which the final classifier is invoked to associate a cost function (a proxy for misclassification) with a given resolution. The sub spaces are then searched and pruned to provide a Wavelet Basis best suited to the classification problem. Comparative results are given illustrating some improvement over the Short-Time Fourier Transform using two differing subclasses of speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Wavelet Based Recognition Using Model Theory for Feature Selection

An increase in accuracy and reduction in computational complexity of the common wavelet-based target recognition techniques can be achieved by using interpretable features for recognition. In this work, the Best Discrimination Basis Algorithm (BDBA) is applied to select the most discriminant complete orthonormal wavelet basis for recognition purposes. The BDBA uses a relative entropy criterion ...

متن کامل

Application of Wavelet Packet Transform in Pattern Recognition of Near-IR Data

The wavelet packet transform is studied as a tool for improving pattern recognition based on near-infrared spectra. Application to the preprocessing of the spectra improves the classification when compared to using either the standard normal variate method or no pretreatment at all. Selecting features from a local discriminant basis instead of from a full decomposition does not improve the resu...

متن کامل

An analog VLSI architecture for auditory based feature extraction

We have developed a low power analog VLSI chip for real time signal processing motivated by the principles of human auditory system. A analog cochlear lter-bank (which is implemented on the chip) decomposes the input audio signal into several frequency bands that have almost equal bandwidth on a log scale. This step is thus similar to computing the wavelet transform. The chip then computes sign...

متن کامل

Speech Emotion Recognition Based on Deep Belief Networks and Wavelet Packet Cepstral Coefficients

A wavelet packet based adaptive filter-bank construction combined with Deep Belief Network(DBN) feature learning method is proposed for speech signal processing in this paper. On this basis, a set of acoustic features are extracted for speech emotion recognition, namely Coiflet Wavelet Packet Cepstral Coefficients (CWPCC). CWPCC extends the conventional MelFrequency Cepstral Coefficients (MFCC)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998